首页> 外文OA文献 >Semantic Perceptual Image Compression using Deep Convolution Networks
【2h】

Semantic Perceptual Image Compression using Deep Convolution Networks

机译:使用深度卷积网络的语义感知图像压缩

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

It has long been considered a significant problem to improve the visualquality of lossy image and video compression. Recent advances in computingpower together with the availability of large training data sets has increasedinterest in the application of deep learning cnns to address image recognitionand image processing tasks. Here, we present a powerful cnn tailored to thespecific task of semantic image understanding to achieve higher visual qualityin lossy compression. A modest increase in complexity is incorporated to theencoder which allows a standard, off-the-shelf jpeg decoder to be used. Whilejpeg encoding may be optimized for generic images, the process is ultimatelyunaware of the specific content of the image to be compressed. Our techniquemakes jpeg content-aware by designing and training a model to identify multiplesemantic regions in a given image. Unlike object detection techniques, ourmodel does not require labeling of object positions and is able to identifyobjects in a single pass. We present a new cnn architecture directedspecifically to image compression, which generates a map that highlightssemantically-salient regions so that they can be encoded at higher quality ascompared to background regions. By adding a complete set of features for everyclass, and then taking a threshold over the sum of all feature activations, wegenerate a map that highlights semantically-salient regions so that they can beencoded at a better quality compared to background regions. Experiments arepresented on the Kodak PhotoCD dataset and the MIT Saliency Benchmark dataset,in which our algorithm achieves higher visual quality for the same compressedsize.
机译:长期以来,人们一直认为改善有损图像和视频压缩的视觉质量是一个重大问题。计算能力的最新进展以及大型训练数据集的可用性已引起人们对深度学习CN应用于图像识别和图像处理任务的兴趣。在这里,我们提出了一个强大的cnn,专门针对语义图像理解的特定任务而设计,以在有损压缩中实现更高的视觉质量。编码器并入了复杂度的适度增加,从而允许使用标准的现成jpeg解码器。尽管可以针对通用图像优化jpeg编码,但是该过程最终未意识到要压缩的图像的特定内容。我们的技术通过设计和训练一个模型来识别给定图像中的多个语义区域,从而使jpeg内容感知。与对象检测技术不同,我们的模型不需要标记对象位置,并且能够在一次通过中识别出对象。我们提出了一种专门针对图像压缩的新cnn架构,该架构会生成一个突出显示突出显着区域的映射,以便可以与背景区域相比以更高的质量对其进行编码。通过为每个类添加完整的功能集,然后对所有功能激活的总和取阈值,我们生成了一个地图,该地图突出显示了语义显着的区域,因此与背景区域相比,它们的编码质量更高。在Kodak PhotoCD数据集和MIT Saliency Benchmark数据集上进行了实验,在相同的压缩尺寸下,我们的算法可实现更高的视觉质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号